Song authorship attribution: a lyrics and rhyme based approach

نویسندگان

چکیده

Abstract In this work, we apply authorship attribution to a large-scale corpus of song lyrics. As sub-category poetry, lyrics embody cultural elements as well stylistic attributes that are not present in prose. We draw attention special characteristics such repetitive sound patterns and rhyme based structures can be key ownership, opportunities cannot employed for other types text tweets, emails, blog posts. first create new balanced, data set 12,000 from 120 different artists. propose CNN models on lyric set, order use structural information included the lyrics, similarly image classification. conduct experiments at character sub-word levels mostly reflect positional information. addition, phoneme level features, which intrinsically involve repetitions, rhyme, meter, represent unique verse-based textual compositions. attempt discover idiosyncratic features consequently author genre associations by working with variants architectures have been successfully used classification domains. Our architecture choice results particular focus residing neighboring regions, since CNNs fail apprehend long term dependencies. Finally, empirically evaluate our comparison findings previous test research

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Artist Attribution via Song Lyrics

Song lyrics, separated from the audio signal of their song, still contain a significant amount of information. Mood and meaning can still be conveyed effectively by a pure textual representation. There has even been somewhat successful previous work on genre classification from song lyrics[7]. Building on previous work, we seek to build an artist attribution system for song lyrics. This task is...

متن کامل

A Web-Based Self-training Approach for Authorship Attribution

As any other text categorization task, authorship attribution requires a large number of training examples. These examples, which are easily obtained for most of the tasks, are particularly difficult to obtain for this case. Based on this fact, in this paper we investigate the possibility of using Webbased text mining methods for the identification of the author of a given poem. In particular, ...

متن کامل

Rhyme and Style Features for Musical Genre Classification by Song Lyrics

How individuals perceive music is influenced by many different factors. The audible part of a piece of music, its sound, does for sure contribute, but is only one aspect to be taken into account. Cultural information influences how we experience music, as does the songs’ text and its sound. Next to symbolic and audio based music information retrieval, which focus on the sound of music, song lyr...

متن کامل

Authorship Attribution

Authorship attribution, the science of inferring characteristics of the author from the characteristics of documents written by that author, is a problem with a long history and a wide range of application. Recent work in “non-traditional” authorship attribution demonstrates the practicality of automatically analyzing documents based on authorial style, but the state of the art is confusing. An...

متن کامل

Maximal Repeats Enhance Substring-based Authorship Attribution

This article tackles the Authorship Attribution task according to the language independence issue. We propose an alternative of variable length character n-grams features in supervised methods: maximal repeats in strings. When character ngrams are by essence redundant, maximal repeats are a condensed way to represent any substring of a corpus. Our experiments show that the redundant aspect of n...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Digital Humanities

سال: 2022

ISSN: ['2524-7832', '2524-7840']

DOI: https://doi.org/10.1007/s42803-022-00050-x